Draft of Nov 4 , 2006 . Description and Search : Metadata as Infrastructure

نویسنده

  • Michael K. Buckland
چکیده

The first and original use of metadata is for describing documents. XML, the Dublin Core and MARC library catalog records are examples. The name “metadata” (beyond or with data) and the popular definition “data about data” is based on this use. A second use of metadata is to form organizing structures by means of which documents can be arranged. These structures can be used both to search for individual documents and also to identify patterns within a population of documents. The second role of metadata involves an inversion of the relationship between document and metadata. These structures can be considered infrastructure. The First Purpose of Metadata: Description The term “metadata” is used to denote “data about data” and its first and original purpose is to describe documents. (Here we do not distinguish between data and documents). There are different kinds of descriptive metadata: technical (to describe format, encoding standards, etc.); administrative (to describe intellectual property rights, conditions of access, etc.); and content (subject matter, scope, authorship, etc.). These descriptions characterize and explain the data. Metadata helps one to understand what the data is and how to make use of it (Caplan 2003, Haynes 2004). Metadata has two components: A format and a set of values. XML, the Dublin Core and MARC library catalog records are well-known formats and are associated with specific standards for specifying the kinds of descriptions that may be used with them. Description can be very useful, even if idiosyncratic terminology is used. Almost any description is better than none. However, it is always strongly recommended that descriptive metadata follow standardized forms, e.g. using a standard format and widely used terminology. The use of standardized formats for storing and displaying makes use of metadata easier. The use of standard vocabularies has the advantage of consistency and aids understanding. All description is a language activity even if an artificial notation, such as the Dewey Decimal Classification system, is used. Description is always and necessarily culturally-based because descriptions are based on the concepts, definitions, and understandings that have developed in a community. When you browse documents, especially digital documents, you are likely to examine descriptive metadata in order to understand what kind of document it is, what it is about, and how to use it. This process resembles the way one can look at the cover of a book to help assess the text within.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating RDF Querying Capabilities into a Distributed Search Infrastructure

The Semantic Web is inherently distributed, and covers both metadata and full-text information. Semantic search therefore can profit a lot from peer-to-peer infrastructures as well as from powerful metadata search functionalities based on full-text search technologies. In this paper we focus on an approach extending an existing P2P search infrastructure with RDF querying capabilities, which bot...

متن کامل

بررسی واکنش موتورهای کاوش وب به پیشینه‌های فرادا‌ده‌ای مبتنی برروش ترکیبی داده‌های خرد و روش داده‌های پیوندی

The purpose of this research was to find out the reaction of Web Search Engines to Metadata records created based on the combined method of Rich Snippets and Linked Data. 200 metadata records in two groups (100 records as the control group with the normal structure and, 100 records created based on microdata and implemented in RDF/XML as experimental group) extracted from the information gatewa...

متن کامل

Metadata, semantics, and ontology: providing meaning to information resources

Metadata research has emerged as a new discipline in the last years, and is focused on the provision of semantic descriptions of a diverse kind to digital resources, web resources being the most frequent target. Such associated descriptions are supposed to serve as a foundation for advanced, improved services in several application areas, including search and location, personalisation, and auto...

متن کامل

A Description Infrastructure for Audiovisual Media Processing Systems Based on MPEG-7

We present a case study of establishing a description infrastructure for media processing systems. The description infrastructure consists of an internal metadata model and access tools for using it. Based on an analysis of requirements, we selected, out of a set of candidates, MPEG-7 as the basis of our metadata model. The openness and generality of MPEG-7 allow using it in a broad range of ap...

متن کامل

An Infrastructure for Building Semantic Web Portals

One important task of semantic web portals is to offer both end users and applications a seamless access to knowledge contained in heterogeneous data sources in specific user communities. As such it is important to ensure that i) high quality metadata are extracted from heterogeneous sources in an automated manner and ii) comprehensive querying facilities are provided thus enabling knowledge to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006